Speech generation

Last updated: June 10, 2025

Our speech generation leaderboard evaluates AI models on their ability to generate high-quality speech from textual descriptions. We assess factors such as speech quality, word error rate and naturalness.

Human preference evaluation

Diverse pool of US-based Alignerrs, including generalists and creative artists

Consensus of three Alignerrs per task

Standardized instructions and ontology for consistent evaluations

Carefully curated prompt generation process, balancing creativity and clarity

Context awareness

Pronunciation accuracy

Speech naturalness

Want us to evaluate your model?

If you’d like us to consider your model as part of the next set of leaderboard evaluations, contact us at leaderboard@labelbox.com.